Planning with Pixels in (Almost) Real Time
نویسندگان
چکیده
Recently, width-based planning methods have been shown to yield state-of-the-art results in the Atari 2600 video games. For this, the states were associated with the (RAM) memory states of the simulator. In this work, we consider the same planning problem but using the screen instead. By using the same visual inputs, the planning results can be compared with those of humans and learning methods. We show that the planning approach, out of the box and without training, results in scores that compare well with those obtained by humans and learning methods, and moreover, by developing an episodic, rollout version of the IW(k) algorithm, we show that such scores can be obtained in almost real time.
منابع مشابه
Real-time detection of wildlife using NOAA/AVHRR data Study area :(Kayamaki Wildlife Refuge)
Forest fire in recent years has paid great attention to climate change and ecosystems. Remote sensing is a quick and inexpensive way to detect and monitor forest fires on a large scale. The purpose of this study was to identify forest and rangeland fire hazards using NOAA / AVHRR in Kayamaki Wildlife Refuge. For the purpose of this study, the history of the fire-burns occurred in MODIS products...
متن کاملجداسازی طیفی با استفاده از الگوریتم HYCA بهبودیافته
Hyperspectral (HS) imaging is a significant tool in remote sensing applications. HS sensors measure the reflected light from the surface of objects in hundreds or thousands of spectral bands, called HS images. Increasing the number of these bands produces huge data, which have to be transmitted to a terrestrial station for further processing. In some applications, HS images have to be sent inst...
متن کاملیک روش جدید افزایش دقت مکانی تصاویر سنجش از دور با استفاده از جدول جستجو
Different methods have been proposed to increase the image spatial resolution by mixed pixels decomposition. These methods can be divided into two groups. Some research have been attempted to obtain percentages of sub pixels and the other try to obtain their locations. These methods and their problems will be examined in this study. Common methods are reviewed with more emphasis. Finally, a new...
متن کاملBuilding a Multi-Objective Model for Multi-Product Multi-Period Production Planning with Controllable Processing Times: A Real Case Problem
Model building is a fragile and complex process especially in the context of real cases. Each real case problem has its own characteristics with new concepts and conditions. A correct model should have some essential characteristics such as: being compatible with real conditions, being of sufficient accuracy, being logically traceable and etc. This paper discusses how to build an efficient mode...
متن کاملProper integration time of polarization signals of internetwork regions using Sunrise/IMaX data
Distribution of magnetic fields in the quiet-Sun internetwork areas has been affected by weak polarization (in particular Stokes Q and U) signals. To improve the signal-to-noise ratio (SNR) of the weak polarization signals, several approaches, including temporal integrations, have been proposed in the literature. In this study, we aim to investigate a proper temporal-integration time with which...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1801.03354 شماره
صفحات -
تاریخ انتشار 2018